27: Maximum Likelihood

Author

Derek Sollberger

Published

April 12, 2023

Today: Maximum Likelihood

Goal: Modify distribution parameters based on observed data

Objectives:

  • derive maximum likelihood estimate for the exponential distribution
  • derive maximum likelihood estimate for the Poisson distribution

Notation

Notation

Recall,

  • Lower-case {x1,x2,x3,...,xn} is a set of observations
  • Upper-case {X1,X2,X3,...,Xn} is a set of random variables (i.e. a data set)
  • Treating {X1,X2,...,Xn} as a set of n i.i.d. (independent and identically distributed) random variables is a common assumption.
  • With independence, P(X1,X2,...,Xn)=P(X1)P(X2)...P(Xn)
  • Each individual probability is computed (at least theoretically) with a PDF (probability density function) P(xi)=fX(xi)

Likelihood

::::: {.panel-tabset}

Definition

Likelihood Function

Let the likelihood function, in terms of a parameter θ, be the joint probability

L(θ)=P(X1=x1,X2=x2,...,Xn=xn)=fX(x1)fX(x2)fX(xn)

or

L(θ;{xi}i=1n)=i=1nfX(xi)

Maximum Likelihood

Likelihood Function

Given a data set {x1,x2,...,xn}, we seek the desired parameter(s) that makes realizing the data set most likely.

L(θ;{xi}i=1n)=i=1nfX(xi)

Maximization

From calculus, recall that the main step in maximizing the value of a function is setting the first derivative equal to zero.

simulation a better simulation

Given a data set {x1,x2,...,xn}, assume an Exp(λ) distribution.

  • Compute the value of rate parameter λ that maximizes the likelihood of the data set.
  • Compute the likelihood at the maximum likelihood estimate (MLE).
  • Characterize the top 5 percent of light bulbs.

Given a data set {x1,x2,...,xn}, assume an Pois(λ) distribution. Compute the value of parameter λ that maximizes the likelihood of the data set.

Estimators Revisited

If we sample from a theoretical U(0,M) distribution, the sample maximum sM of each sample is less than or equal to M

sMM

It would follow that the average of the sample maxima underestimates the true maximum

E[sM]M

Therefore the sample maximum is a biased estimator of the true maximum.

 

Similarly, the sample minimum sm from a U(m,0) distribution overestimates

E[sm]m

Therefore the sample min-mum is a biased estimator of the true minumum.

Looking Ahead

Final Exam will be on May 6